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SorJal Number: 

CI Changed a file from non-ASCII to ASCII 
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□ 



CRP^^Irs- CdfT^tim^T^& STIC Systb 





□ 



□ 
□ 



□ 



L. 



□ 
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□ 






ranch 

CHF ProcossJng Dato: / 



rocossing uato: / 



Changed the margins in cases where the sequence text was -wrapped.- <^w|^^l^ejd line 
Edited a format error in the Current Application Data section. specilicaJly: 



Edited the Current Application Data section wrth the actual current number. The number inputted by the 
applicant was □ the prior application data; or D other , • 

Added the mandatory heading and subheadings (or "Cun-ent Application Data*. 

Edited the "Number ol Sequences' field. The applicant spelled out a number instead of using an integer. 
Changed tho spelling ol a mandatory field (the headings or-subheadings). specfti«»/ 



Corrocted tho SEQ ID NO when obviously incorrect. Tho sequence gumbers thai were edited were: 



Inserted or corrected a nucleic number at the end ol a nucleic lino. SEQ ID NO's edited: 



Corrected subheading placcmenl. All responses must bo on tho same line as each subheading, 
applicant placed a response below the subheading, this was moved to its appropriate place. 

Inserted colons after headings/subheadings. Headings edited included: 



If \hQ 



Deleted extra, invalid, headings used by an applicant. specificaJly 



Deleted: □ non-ASCII -garbage' at tho beginning/end of files; □ secretary inrtials/filename at end of file: 
□ page numbers throughout text; □ other invalid text, such as. 



Inserted mandatory headings, specifically: 



:z. 



Corrected an obvious error in tho response, specifically: 



Edited identifiers where upper case is used but lower case is required, or vice versa, 



CorrecTod an error in tho Number of Sequences field, specifically: 



A -Hard Pago Break" code was inserted by tho applicant. All occurrences had to bo deleted. 

Deleted ending' stop codon in amino acid sequences and adjusted the -(A)Length:- field accordingly (error 
duo to a Palontin bug). Sequences conected: ^ 

Other: 




^aminer: Tho above corrections must bo communicated to the applicant in the first Office 

Action. DO NOT send a copy of this form. 
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C 
C 



> 
> 



3 
4 
5 
7 
8 
10 
12 
13 
15 
16 
18 
19 
21 
23 
25 
26 
27 
28 
30 
31 
32 
34 
35 
36 
38 
39 
40 
41 
43 
44 
45 
47 
48 
49 
51 
52 
53 
55 
56 
57 
59 
60 
61 
63 
64 
65 
67 



<110> 



<120> 

<130> 
<140> 
<141> 

<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



APPLICANT: Madden ^ Mark 
Weiner/ David P.^ 
Chaplin/ Jennifer A.-*' 

TITLE OF INVENTION: METHODS FOR PRODUCING 
ALPHA-SUBSTITUTED CARBOXYLIC ACIDS 
FILE REFERENCE: DIVER1440-2 

CURRENT APPLICATION NUMBER: US/09/7 51,2 99 
CURRENT FILING DATE: 2000-12-29 

PRIOR APPLICATION NUMBER: 60/254,414 

PRIOR FILING DATE: 2000-12-07 

PRIOR APPLICATION NUMBER: 60/173,609 

PRIOR FILING DATE: 1999-12-29 

NUMBER OF SEQ ID NOS : 4 

SOFTWARE: Patentin Ver . 2.1 

SEQ ID NO: 1 

LENGTH: 1041 

TYPE: DNA 

ORGANISM: Unknown Organism 
FEATURE : 

OTHER INE'ORMATION: Description of Unknown 
environmental sample 



<220> 
<221> 
<222> 
<400> 
atg teg 
Met Ser 



FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 



1 

ccg 
Pro 

ate 
He 



gtg 
Val 

gag 
Glu 



act 
Thr 



tgg 
Trp 
50 

tgg ggc 
Trp Gly 
65 

ggc age 
Gly Ser 



atg 
Met 



cat 
His 



9^9 
Glu 

ttc 
Phe 

cag 
Gin 
35 
att 
He 

atg 
Met 

aag 
Lys 

gtc 
Val 



ccc 
Pro 

etc 
Leu 
20 
gcg 
Ala 

ccc 
Pro 

egc 
Arg 

cag 
Gin 



CDS 

(1) 

1 

atg 
Met 
5 

gat 
Asp 

gee 
Ala 

ggc 

Gly 

ttc 
Phe 

tgg 

Trp 
85 
gee 
Ala 



(1041) 



acg 
Thr 

etc 
Leu 

aag 
Lys 

tat 
Tyr 

gtc 
Val 
70 
cag 
Gin 



aag tat cgc ggc gcg 
Lys Tyr Arg Gly Ala 

10 

aca 
Thr 
25 
gtg 
Val 



gac 
Asp 

cag 
Gin 



cgc 
Arg 



gtc 
Val 



ecc 
Pro 
55 
cag 
Gin 



gac 
Asp 
40 
ttt 
Phe 



cgc 
Arg 



gag 
Glu 

ctg 
Leu 



tgg 
Trp 



cgc tat 
Arg Tyr 



ata 
He 

ttc 
Phe 



tgg 
Trp 

gag 
Glu 
75 
gcg 
Ala 



gcg 
Ala 

aaa 
Lys 

ate 
He 

Gtg 
Leu 
60 
aat 
Asn 



atg ggc cag 



gtg 
Val 
100 - 
gcg ate 



gcc ctg gcg gat gcg gcc 
Ala Leu Ala Asp Ala Ala 

90 

ggc tat age gag cgc gcg ggc 
Gly Tyr Ser Glu Arg Ala Gly 

105 

ttc ggc eee gat ggc gat ctg 



ENANTIOMERICALLY PURE 



Organism: Obtained from an 



gtg 
Val 

gcg 
Ala 

gca 
Ala 
45 
ggc 
Gly 



cag 
Gin 

ate 
He 
30 
ttc 
Phe 

gcg 
Ala 



gee gcg 
Ala Ala 

15 
ggc ctg 
Gly Leu 

eca gag 
Pro Glu 

ccg get 
Pro Ala 



teg etc 
Ser Leu 

egc cgc 
Arg Arg 

ggc age 
Gly Ser 
110 
ate gcc 



gtg 
Val 

cac 
His 
95 
etc 
Leu 



cgc 
Arg 
80 
ggc 
Gly 

tat 
Tvr 



gcg cge 



48 



96 



144 



192 



240 



288 



336 



384 
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68 


Met 


Gly 


Gin 


Ala 


lie 


Pne 


Gly 


Pro 


Asp 


Gly 


Asp 


Leu 


lie 


Ala 


Ala 


Arg 




69 






11 D 










121) 










HOC 

12 D 










/ 1 


cgc 


aag 


etc 


aag 


cot 


acc 


Ca u 


gcg 


gag 


cgc 


acc 


gtg 


4- 4- 


ggc 


gag 


gga 


4 0 2 


12 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Pne 


Gly 


Glu 


Gly 




/ J 




1 J U 










13 5 










1 / A 

14 U 












/ D 


gac 




age 


cat 


ccc 


gcg 


gtg 


cac 


gat 


ace 


gee 


a lc 


ggg 


cgc 


etc 


ggc 


4 0 U 


7 6 


Asp 


Gly 


Ser 


His 


Leu 


TV 1 

Ala 


Vai 


His 


Asp 


m -B^* 

i nr 


Ala 


1 le 


Gly 


Arg 


T All 

Leu 


Gly 




/ / 


14 D 










T C A 

Ibu 










iOD 










160 




/ y 


gcg 


4— y-^ 

cue 


tgt 


tgc 


tgg 


gag 


cac 


ate 


cag 


cca 


4- 4- /-T 

ttg 


teg 


aaa 


tac 


gcc 


atg 


0 2 0 


80 


•71,-1 

Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


lie 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 


Met 




81 










155 










1 T A 

170 










175 






o J 


tac 


gcc 


gcc 


gac 


gaa 


cag 


gtc 


cac 


gtc 


gcg 


teg 


tgg 


ccg 


age 


ttc 


age 


c T cr 
3 / b 


84 


Tyr 


Ala 


Ala 


Asp 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ser 


Trp 


Pro 


Ser 


1^ 1^ -.w 

Phe 


Ser 




o c 
O 3 








T 0 A 
1 0 (J 










IOC 

lob 










1 0 A 

1 y u 








O / 


etc 


■f- :3 +- 


cgc 


ggc 


a t.g 


gcc 


tat 


gcg 


etc 


gga 


ccg 


gag 


y^ 4^ 

giLC 


aaii 


acc 


gcc 


0 2 4 


88 


Leu 


Tyr 


Arg 


Gly 


Met 


HI™ 
Ala 


Tyr 


Ala 


T _ _ _ 

Leu 


Gly 


Pro 


Glu 


Vai 


Asn 


Thr 


TV n ~ 

Ala 




89 






1 A C 

19 5 










0 A A 

200 










'1 A C 

20 5 












gca 


age 


cag 


ate 


tac 


9cg 


gtc 


gag 


ggc 


ggc 


tgc 


tac 


gtg 


ctg 


gcg 


teg 


C '7 0 

b / 2 


92 


Ala 


Ser 


Gin 


lie 


Tyr 


Ala 


Val 


Glu 


Gly 


Gly 


Cys 


Tyr 


Val 


Leu 


Ala 


Ser 




y o 




2 lU 










215 










''t A A 

2 20 












y D 


tgc 


gcg 


acc 


g uL 


teg 


ccg 


gag 


atg 


ate 


aag 


gta 


4- 4- j-r 

ttg 


gtg 


gat 


acg 


ccc 


'Ton 
/ 2 U 


y b 


Lys 


Ala 


i nr 


vai 


Ser 


-i-w ^—^ 

PL 0 


Glu 


Me 11 


lie 


Lys 


val 


Leu 


val 


Asp 


1 nr 


Pro 




y / 


J 2 D 










O A 

2 i U 










n 1 

2 3 5 










24 U 




Q C 

y y 


gac 


aag 


9^9 


atg 


4- -1- ^ 

u lC 


etc 


aag 


gee 


ggc 


ggc 


ggt 


■1-4-4- 
L L U 


gee 


atg 


3-1-4- 

a L.U 


4- 4- ^ 
L. UC 


"7 c; Q 
/bo 


inn 
1 U U 


Asp 


Lys 


GlU 


-t- 

Me L 


T~l V-» ^ 

Pile 


Leu 


Lys 


Ala 


Gly 


Gly 


Gly 


n 

Fne 


Ala 


Met 


lie 


D K ^ 

Pile 




JL U X 










z 4 J 










1 c; A 
Z J U 
















± U -3 


ggg 


ccc 


gac 


99c 


cgc 


gcc 


c eg 


gcc 


gag 


ccg 


cue 


ccg 


gag 


acc 


gaa 


gag 


0 1 b 


1U4 


Gly 


Pro 


Asp 


Gly 


Arg 


Ala 


Leu 


Ala 


Glu 


Pro 


Leu 


Pro 


Glu 


Thr 


Glu 


Glu 




lU b 








OCA 










o cr 

265 










OTA 

2/0 








X u / 




c L.g 


c Lg 


gtc 


gcc 


gat 


ate 


gac 


c tc 


ggc 


atg 


ate 


gcg 


ttg 


gcc 


aag 


0 b 4 


lOo 


Gly 


Leu 


Leu 


Vai 


Ala 


Asp 


lie 


Asp 


Leu 


Gly 


Met 


lie 


Ala 


Leu 


TV T 

Ala 


Lys 




luy 






27 5 










0 A A 

280 










A n c 

2 85 










111 


gcg 


gcg 


gcc 


gat 


ccg 


gcg 


ggc 


cac 


tat 


tea 


egg 


ccc 


gac 


gta 


acg 


egg 


912 


112 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Vai 


Thr 


Arg 




1 1 J 




290 










"I A cr 

2 9 5 










0 A A 

300 












11 D 


ctg 


ctg 


ctg 


gat 


ega 


cgt 


ccg 


gcc 


caa 


cgc 


gtc 


gtc 


acg 


^ 4. i_ 
ctt 


gat 


gee 


A A 

9 6 0 


116 


Leu 


Leu 


Leu 


Asp 


Arg 


Arg 


Pro 


Ala 


Gin 


Arg 


Vai 


Val 


Thr 


Leu 


Asp 


Ala 




117 


305 










310 










315 










320 




119 


gca 


ttc 


gaa 


ccg 


caa 


aac 


gag 


gac 


aag 


ggc 


gac 


gcg 


ccc 


gcg 


Ctg 


cgc 


1008 


120 


Ala 


Phe 


Glu 


Pro 


Gin 


Asn 


Glu 


Asp 


Lys 


Gly 


Asp 


Ala 


Pro 


Ala 


Leu 


Arg 




121 










325 










330 










335 






123 


gtg 


gtg 


gcg 


gaa 


age 


gcc 


gcc 


gcc 


gcg 


cag 


tag 












1041 


124 


Vai 


Val 


Ala 


Glu 


Ser 


Ala 


Ala 


Ala 


Ala 


Gin 
















125 








340 










345 


















128 


<210> SEQ ID NO 


: 2 


























129 


<211> LENGTH: 346 


























13 0 


<212> TYPE: 


PRT 




























131 


<213> ORGANISM: 


Unknown 


Organism 


















133 


<220> FEATURE: 
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134 <223> OTHER INFORMATION: Description of Unknown Organism: Obtained from an 

135 <223> OTHER INFORMATION: environmental sample 

137 <400> SEQUENCE: 2 

138 Met Ser Glu Pro Met Thr Lys Tyr Arg Gly Ala Ala Val Gin Ala Ala 

139 1 5 - 10 15 

140 Pro Val Phe Leu Asp Leu Asp Arg Thr Val Glu Lys Ala lie Gly Leu 

141 20 25 30 

142 lie Glu Gin Ala Ala Lys Gin Asp Val Arg Leu lie Ala Phe Pro Glu 

143 35 40 45 

144 Thr Trp lie Pro Gly Tyr Pro Phe Trp lie Trp Leu Gly Ala Pro Ala 

145 50 55 60 

14 6 Trp Gly Met Arg Phe Val Gin Arg Tyr Phe Glu Asn Ser Leu Val Arg 

147 65 70 75 80 

148 Gly Ser Lys Gin Trp Gin Ala Leu Ala Asp Ala Ala Arg Arg His Gly 

149 85 90 95 

150 Met His Val Val Ala Gly Tyr Ser Glu Arg Ala Gly Gly Ser Leu Tyr 

151 100 105 110 

152 Met Gly Gin Ala lie Phe Gly Pro Asp Gly Asp Leu lie Ala Ala Arg 

153 115 120 125 

154 Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly 

155 130 135 140 

156 Asp Gly Ser His Leu Ala Val His Asp Thr Ala lie Gly Arg Leu Gly 

157 145 150 155 160 

158 Ala Leu Cys Cys Trp Glu His lie Gin Pro Leu Ser Lys Tyr Ala Met 

159 165 170 175 

160 Tyr Ala Ala Asp Glu Gin Val His Val Ala Ser Trp Pro Ser Phe Ser 

161 180 185 190 

162 Leu Tyr Arg Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala 

163 195 200 205 

164 Ala Ser Gin lie Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser 

165 210 215 220 

166 Cys Ala Thr Val Ser Pro Glu Met lie Lys Val Leu Val Asp Thr Pro 

167 225 230 235 240 

168 Asp Lys Glu Met Phe Leu Lys Ala Gly Gly Gly Phe Ala Met lie Phe 

169 245 250 255 

170 Gly Pro Asp Gly Arg Ala Leu Ala Glu Pro Leu Pro Glu Thr Glu Glu 

171 260 265 270 

172 Gly Leu Leu Val Ala Asp He Asp Leu Gly Met He Ala Leu Ala Lys 

173 275 280 285 

17 4 Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Arg 

175 290 295 300 

176 Leu Leu Leu Asp Arg Arg Pro Ala Gin Arg Val Val Thr Leu Asp Ala 

177 305 310 315 320 

178 Ala Phe Glu Pro Gin Asn Glu Asp Lys Gly Asp Ala Pro Ala Leu Arg 

179 325 330 335 

180 Val Val Ala Glu Ser Ala Ala Ala Ala Gin 

181 340 345 

185 <210> SEQ ID NO: 3 

186 <211> LENGTH: 1014 
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TOT 

187 


<212> TYPE: 


DNA 




























188 


<213> ORGANISM: 


Unknown 


Organism 


















190 


<220> FEATURE: 




























191 


<223> OTHER 


INFORMATION 


: Description of 


Unknown 


Organism: Obtained from 


192 




environmental 


sample 






















194 


<220> FEATURE: 




























19 5 


<221> NAME/KEY: 


O T\ O 

CDS 


























196 


<222> LOCATION: 


/■ 1 ^ 

( -1- ; 


. . (1014) 






















198 


<400> SEQUENCE: 




























199 


a tg 


aaa 


gaa 


get 


ate 


aag 


gtc 


gcc 


tgc 


gtg 


caa 


gcc 


gcc 


ccg 


ate 


tac 


4 8 


200 


Met 


Lys 


Glu 


Ala 


lie 


-Lys 


V a r 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


lie 


Tyr 




201 


1 








5 










lU 










Id 






203 


a tg 


gat 


t tg 


gag 


gcg 


aeg 


gtg 


gac 


aaa 


acc 


att 


gag 


ttg 


atg 


gaa 


gaa 


96 


204 


Met 


Asp 


Leu 


Glu 


Ala 


i nr 


V a i. 


Asp 


Lys 


Thr 


He 


Glu 


Leu 


Met 


"1 

Glu 


Glu 




205 








20 




















J U 








207 


gca 


gca 


egt 


aat 


aa t 


get 


egt 


ctg 


ate 


gcc 


tt t 


ccg 


gaa 


act 


tgg 


-\ 4- 4- 

att 


14 4 


208 


A t —1 

Ala 


Ala 


Ar g 


Asn 




/l±a 


TV ^ 

Arg 


Leu 


He 


Ala 


^ 

Pne 


Pro 


Glu 


Thr 


Trp 


He 




209 






3 5 










4 U 




















211 


cca 


ggc 


tac 


cca 


tgg 


4-4-4- 
ttt 


*-i4- 4- 


tgg 


^4-4- 

Ctt 


gac 


tea 


cca 


gca 


tgg 


gca 


atg 


192 


212 


Pro 


Gly 


Tyr 


Pro 


irp 


D ll £=1 

jr ne 


Lieu 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


Met 




213 
























0 u 












215 


caa 


4- -t- +• 
U L. L. 


gta 


cgc 


caa 


ta c 


cat 


gag 


aac 


tea 


4- 4- « 

ttg 


gag 


4- 4- 

ttg 


gat 


ggc 


eet 


24 0 


216 


Gin 


n A 

Pne 


vai 


Arg 


urn 


iyi 


n JL b 


Glu 


Asn 


Ser 


Leu 


Glu 


Leu 


Asp 


oly 


Pro 




217 


D D 










1 u 
























ziy 






aag 


y-^ >T ^-S 

cgc 


a ur 


rca 


gar 


gca 


gee 


aag 


egg 


4- 4~ ™ 

ttg 


gga 


ate 


atg 


gtc 


288 


220 


bin 


Ala 


Lys 


Arg 


lie 


Q 


ASp 


Ala 


Ala 


Lys 


Arg 


Leu 


Gly 


He 


Met 


Val 




221 










Q C 
O D 




















y b 






o o t> 


acc 


etg 




atg 


agr 


gaa 


egg 


gtc 


ggt 


ggc 


acc 


ctt 


tac 


ate 


agt 


cag 


3 3 6 


224 


Thr 


Leu 


Gly 


Met 




Pill 


Ar g 


vai 


biy 


biy 


i nr 


Leu 


Tyi 


He 


IrH. 

Ser 


Gin 




225 








100 










J. U 0 










Tin 
1 1 u 








22 / 


tgg 


ttc 


ata 


ggc 


gat 


aat 


ggt 


gac 


ace 


att 


ggg 


gcc 


egg 


cga 


aag 


ttg 


3 84 


228 


Trp 


Phe 


He 


Gly 


Asp 


Asn 


Gly 


Asp 


fn Us 

Tnr 


He 


Gly 


Ala 


Arg 


Arg 


Lys 


Leu 




22 9 






115 










Iz U 










12 b 










Z J 1 


aaa 


act 


act 


ttt 


gtt 


gaa 


egt 


act 


ttg 


ttc 


ggc 


gaa 


ggg 


gat 


ggt 


tea 


432 


232 


Lys 


Pro 


Thr 


Phe 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 




233 




130 










135 










140 












235 


teg 


eta 


gcg 


gtt 


ttc 


gag 


aeg 


tct 


gtt 


gga 


agg 


ctg 


ggt 


ggc 


tta 


tgc 


480 


236 


Ser 


Leu 


Ala 


Val 


Phe 


Glu 


Thr 


Ser 


Val 


Gly 


Arg 


Leu 


Gly 


Gly 


Leu 


Cys 




237 


145 










150 










155 










160 




239 


tgt 


tgg 


gag 


cac 


ctt 


caa 


ccg 


eta 


aca 


aaa 


tac 


get 


ttg 


tat 


gca 


caa 


528 


240 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Thr 


Lys 


Tyr 


Ala 


Leu 


Tyr 


Ala 


Gin 




241 










165 










170 










175 






243 


aat 


gaa 


gag 


att 


cat 


tgt 


gcg 


get 


tgg 


ccg 


age 


ttt 


age 


ctt 


tat 


cct 


576 


244 


Asn 


Glu 


Glu 


He 


His 


Cys 


Ala 


Ala 


Trp 


Pro 


Ser 


Phe 


Ser 


Leu 


Tyr 


Pro 




245 








180 










185 










190 








24 7 


aat 


gcg 


gcg 


aaa 


gcc 


ctg 


ggg 


cct 


gat 


gtc 


aat 


gta 


gcg 


gcc 


tct 


cga 


624 


248 


Asn 


Ala 


Ala 


Lys 


Ala 


Leu 


Gly 


Pro 


Asp 


Vai 


Asn 


Val 


Ala 


Ala 


Ser 


Arg 




249 






195 










200 










205 










251 


ate 


tat 


gcc 


gtt 


gaa 


ggg 


caa 


tgc 


ttc 


gta 


eta 


gcg 


teg 


tgt 


gcg 


etc 


672 
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Output Set: N:\CRF3\01312001\l751299.raw 



252 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Gin 


Cys 


Phe 


Val 


Leu 


Ala 


Ser 


Cys 


Ala 


Leu 




253 




210 










215 










220 












255 


gtt 


tea 


ca a 


tee 


atg 


ate 


gat 


atg 


ctt 


tgt 


aea 


gat 


gac 


gaa 


aag 


eat 


720 


256 


Val 


Ser 


Gin 


Ser 


Met 


He 


Asp 


Met 


Leu 


Cys 


Thr 


Asp 


Asp 


Glu 


Lys 


His 




257 


225 










230 










235 










240 




259 


gcg 


1 J . 

ttg 


ctt 


etg 


get 


ggt 


^ _t_ 

ggt 


gga 


eac 


tea 


egt 


ate 


ata 


ggg 


cct 


gat 


768 


260 


Ala 


Leu 


Leu 


Leu 


Ala 


Gly 


Gly 


Gly 


His 


Ser 


Arg 


He 


He 


Gly 


Pro 


Asp 




261 










245 










250 










255 






263 


ggt 


ggt 


gac 


ttg 


gtc 


gcg 


cct 


ctt 


gcc 


gaa 


aat 


gaa 


gag 


ggt 


att 


etc 


816 


264 


Gly 


Gly 


Asp 


Leu 


Val 


Ala 


Pro 


Leu 


Ala 


Glu 


Asn 


Glu 


Glu 


Gly 


He 


Leu 




265 








260 










265 










270 








267 


tac 


gca 


aac 


ett 


gat 


eet 


gga 


gta 


cgc 


ate 


ctt 


get 


aaa 


atg 


gcg 


gca 


864 


268 


Tyr 


Ala 


Asn 


Leu 


Asp 


Pro 


Gly 


Val 


Arg 


He 


Leu 


Ala 


Lys 


Met 


Ala 


Ala 




269 






275 










280 










285 










271 


gac 


cct 


get 


ggt 


eat 


tat 


tee 


egt 


eee 


gae 


att 


act 


cgc 


ttg 


eta 


ata 


912 


272 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Thr 


Arg 


Leu 


Leu 


He 




273 




290 










295 










300 












275 


gat 


cgc 


age 


eet 


aaa 


tta 


ccg 


gta 


gtt 


gaa 


att 


gaa 


ggt 


gat 


ctt 


egt 


960 


276 


Asp 


Arg 


Ser 


Pro 


Lys 


Leu 


Pro 


Val 


Val 


Glu 


He 


Glu 


Gly 


Asp 


Leu 


Arg 




277 


305 










310 










315 










320 




279 


cct 


tac 


get 


ttg 


ggt 


aaa 


gcg 


tct 


gag 


acg 


ggt 


gcg 


eaa 


ete 


gaa 


gaa 


1008 


280 


Pro 


Tyr 


Ala 


Lea 


Gly 


Lys 


Ala 


Ser 


Glu 


Thr 


Gly 


Ala 


Gin 


Leu 


Glu 


Glu 




281 










325 










330 










335 






283 


att 


tga 






























1014 


284 


He 


































287 


<210> SEQ ID NO 


: 4 


























283 


<211> LENGTH: 337 


























289 


<212> TYPE: 


PRT 




























290 


<213> ORGANISM: 


Unknown 


Organism 


















292 


<220 


i> FEATURE: 




























293 


<223> OTHER 


INFORMATION: 


Descriptior 


\ of 


Unknown 


Organism: Obtained 


from 


294 


<223 


> OTHER 


INFORMATION: 


environmental ; 


sample 












296 


<400> SEQUENCE: 


4 


























297 


Met 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


He 


Tyr 




298 


1 








5 










10 










15 






299 


Met 


Asp 


Leu 


Glu 


Ala 


Thr 


Val 


Asp 


Lys 


Thr 


He 


Glu 


Leu 


Met 


Glu 


Glu 




300 








20 










25 










30 








301 


Ala 


Ala 


Arg 


Asn 


Asn 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 




302 






35 










40 










45 










303 


Pro 


Gly 


Tyr 


Pro 


Trp 


Phe 


Leu 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


Met 




304 




50 










55 










60 












305 


Gin 


Phe 


Val 


Arg 


Gin 


Tyr 


His 


Glu 


Asn 


Ser 


Leu 


Glu 


Leu 


Asp 


Gly 


Pro 




306 


65 










70 










75 










80 




307 


Gin 


Ala 


Lys 


Arg 


He 


Ser 


Asp 


Ala 


Ala 


Lys 


Arg 


Leu 


Gly 


He 


Met 


Val 




308 










85 










90 










95 






309 


Thr 


Leu 


Gly 


Met 


Ser 


Glu 


Arg 


Val 


Gly 


Gly 


Thr 


Leu 


Tyr 


He 


Ser 


Gin 




310 








100 










105 










110 








311 


Trp 


Phe 


He 


Gly 


Asp 


Asn 


Gly 


Asp 


Thr 


He 


Gly 


Ala 


Arg 


Arg 


Lys 


Leu 




312 






115 










120 










125 











file://C:\Crf3\Outhold\VsiI75l299.htni 



1/31/01 



Page 6 of 7 



VERIFICATION SUMMARY DATE: 01/31/2001 

PATENT APPLICATION: US/0 9/7 51,299 TIME: 11:49:56 

I 

Input Set : A:\Pto, amc 

Output Set: N:\CRF3\01312001\I751299.raw 

L:12 M:270 C: Current Application Number differs/ Replaced Current Application Number 
L;13 M:271 C: Current Filing Date differs, Replaced Current Filing Date 



file://C:\Crf3\Outhold\Vsd75 1 299.htm 



1/31/01 



Page 1 of 7 



OIPE 



RAW SEQUENCE LISTING DyVrE : 
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01/23/2001 

Does Not Comply 
Corrected Diskette Needed 



3 <110> APPLICANT: Madderi, Mark 

4 V^ei.ner, David P. 

5 Chaplin, Jennifer A. 

'? <120> TITLE OF I^^VENTION: METHODS FOR PRODUCING ENANTIOMERICALLY PURE 
3 ALPHA -SUBSTITUTED CARBOXYLIC ACIDS 





10 


<13 0> 


FILE 


REFERENCE: D1VER1440 


-2 


















C-- > 


12 


<140> 


CURRENT APPLICATION NUMBER: US/09/751,299 












C-- > 


13 


<141> 


CURRENT FILING DATE: 


2000 


-12- 


29 


















15 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


6 0/ 


254 , 414 
















16 


<151> 


PRIOR 


FIIjING 


DATE 


: 2000-12-07 




















18 


<15 0> 


PRIOR 


APPLICATION 


NUMBER: 


60/173, 


609 
















:i 9 


<151> 


PRIOR 


PILING 


DATE 


: 1999-12 -20 




















21 


<160> 


NUMBER OF 


SEQ 


ID 1 


NOS : 


4 






















23 


<170> 


SOFTWARE: 


Patentin Ver. 2 


. 1 




















25 


<2.10> 


SEQ ID NO 


: 1 




























26 


<211> 


LENGTH: 1041 




























27 


<212> 


TYPE: 


DNA 






























28 


<213> 


ORGANISM: 


Unknown 


Organism 




















30 


<220> 


FEATURE: 






























31 


<223> 


OTHER 


INFORMATION 


: Descr.iption of 


Unknown 


Organism: Obtained from 




3 2 






environmenta 1 


sample 
























34 


<220> 


FEATURE: 






























35 


<221> 


NAME/KEY : 


CDS 




























36 


<222> 


loc:atton : 


<1) 


. . (1041) 
























38 


<400> 


SEQ LIE! 


"iCE: 


1 




























39 


attj 


teg 


gag 


CCC 


atg 


a eg 


aag 


tat 


cgc 


ggc 


gcg 


gcg 


gtg 


cag 


gee 


gcg 


48 




4 0 


Met 


Ser 


Glu 


Pro 


Met 


Thr 


Lys 


Tyr 


Arg 


Gly 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 






41 


X 










5 










10 










15 








4 3 


ccg 


gtg 


ttc 


etc 


gat 


etc 


gac 


cgc 


cjca 


gtc 


9^9 


a a a 


gcg 


ate 


ggc 


ctg 


96 




4 4 


Pro 


Va 1 


Phe 


Leu 


Asp 


Leu 


Asp 


Arg 


Thr 


Val 


Glu 


Lys 


Ala 


He 


Gly 


Leu 






4 5 










20 










25 










30 










47 


ate 


gag 


cag 


gcg 


gcc 


aag 


cag 


gac 


gtg 


cgc 


Ctg 


a tc 


gca 


ttc 


cca 


gag 


144 




48 


lie 


Glu 


Gin 


Ala 


Ala 


Lys 


Gin 


Asp 


Val 


Arg 


Leu 


He 


Ala 


Phe 


P.ro 


Glu 






49 








3 5 










40 










^15 












51 


act 


tgg 


att 


CCC 




tat 


ccc 


ttt 


tgg 


ata 


tgg 


ctg 


ggc 


gcg 


ccg 


get 


192 




52 


Thr 


Trp 


He 


P.ro 


Gly 


Ty.r 


P ro 


Phe 


T.rp 


lie 


Trp 


Leu 


Gly 


Ala 


Pro 


Ala 






5 3 




50 










55 










60 














5 5 


tgg 


99 c 


atg 


cgc 


tLc 


gtc 


cag 


cgc 


tat 


ttc 


gag 


aa t 


teg 


etc 


gtg 


cgc 


240 




56 


Trp 


Gly 


Met 


Arg 


Phe 


val 


Gin 


Arg 


Tyr 


Phe 


Glu 


Asn 


Ser 


Leu 


Val 


Arg 






57 


65 












70 










75 










80 






59 


99C 


age 


aag 


cag 


tgg 


cog 


gcc 


etg 


gcg 


gat 


gcg 


gcc 


cgc 


cgc 


cac 


ggc 


288 




60 


G.ly 


Ser 


Lys 


G].n 


Trp 


Glu 


Ala 


Lea 


Ala 


Asp 


Ala 


Ala 


Arg 


Arg 


H j.s 


Gly 






61 












85 










90 










95 








6 3 


atg 


ca t 


gUc 


gtg 


gcc 


99<^ 


tat 


ago 


gag 


cgc 


gcg 


ggc 


99 c 


age 


etc 


ta t 


3 36 




6 4 


Met 


His 


Va.l 


Val 


Ala 


Gly 


Tyr 


Ser 


Glu 


Arg 


Ala 


Gly 


Gly 


Ser 


Leu 


Tyr 






65 










100 










105 










110 










67 


atg 


ggc 


cag 


ycg 


ate 


ttc 


ggc 


eec 


gat 


ggc 


gat 


ctg 


ate 


gcc 


gcg 


cgc 


3 84 
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6 8 
69 
71 
72 
73 
75 
76 
77 
79 
0 0 
81 
83 
84 
85 
87 
88 
89 
91 
9 2 
93 
95 
96 
97 

9 9 
100 
101 
103 

;i,04 

105 
107 

10 8 
10 9 
111 
112 

;i 13 

115 
116 
117 

;t 19 

120 
121 
123 
124 
125 
12 8 
129 
130 
131 
> 132 



Met Gly Gin Ala lie Phe Gly Pro Asp Gly Asp Leu lie Ala Ala Arg 

115 120 12 5 

cqc aac] etc aatj cot 'dCQ cat qcg gaq cqc acc qtg tt:c gcjc qacj cjga 
Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly 

130 135 140 

gac ggc age cat etc gcg gtg cac gat acc gcc ate ggg cgc etc ggc 
Asp Gly Ser His Leu Ala Val Hi.s Asp Tlir Ala lie Gly Arg Leu Gly 
14 5 ■ 150 155 160 

gcg etc tgt tgc tgg gay cac ate cag cca ttg teg aaa tac gee atg 
Ala Leu Cys Cys Trp Glu His lie Gin Pro Leu Ser Lys Tyr Ala Met 

165 170 175 

tac gcc gee gac gaa cag gtc cac ytc gcg teg tgg ecg age ttc age 
Tyr Ala Ala Asp Glu Gin Val uis Val Ala Ser Trp Pro Ser Phe Ser 

180 185 190 

etc tat ege ggc atg gee tat gcg etc gga ecg gag ytc aat acc gcc 
Leu Tyr Az'g Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala 

195 200 205 

gca age cag ate tac gcg gtc gag ggc gge tgc tac qtg otg gcg teg 
Ala Ser Gin lie Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser 

210 215 220 

tgc qeg ace gtt teg ecg gag atg ate aag gta ttg gtg gat a eg ccc 
Cys Ala Thr Val Ser Pro Glu Met lie Lys Va J. Leu Val Asp Thr Pro 
225 230 235 240 

gac aag gag atg ttc etc aag gee ggc ggc ggt ttt yec atg att ttc 
Asp Lys Glu Met Phe Leu Lys Ala Gly Gly Gly Phe Ala Met Tie Phe 

245 250 255 

ggg ccc gac ggc cgc gcc ctg gcc gag ecg ct:c ecg gag acc gaa gag 
Gly Pro Asp Gly Arg Ala Leu Ala Glu Pro Leu Pro Glu Thr Glu Glu 

260 265 270 

gga ctg ctg gtc gcc gat ate gac etc ggc at,g ate gcg ttg gcc aag 
Gly Leu Leu val Ala Asp Tie Asp Leu Gly Met He Ala Leu Ala Lys 

275 280 285 

gcg gcg gcc gat ecg gcg ggc cac tat tea egg ccc gac gta acg egg 
Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Arg 

290 295 300 

ctg ctg ctg gat cga cgt ecg gcc caa cgc gtc gtc acg ctt gat gcc 
Leu Leu Leu Asp Arg Arg Pro Ala Gin Arg Val Val Thr Leu Asp Ala 
305 310 315 320 

gca tf.c gaa cog caa aac gag gac aag ggc gac gcg ccc gcg ctg cgc 
Ala Phe Glu Pro Gin Asn Glu Asp Lys Gly Asp Ala Pro Ala Leu Arg 

325 330 335 

gtg gtg gcg gaa age gcc gcc gcc gcg cag tag 
VaL Val Ala Glu Ser Ala Ala Ala Ala Gin 

340 345 
<210> SEQ ID NO. 2 
<211> LENGTH; 34 6 
<212> TYPE: PRT 

ORGANISM: Unknown Organism 
FEATURE : 



4 32 



480 



5 28 



5 76 



62 4 



672 



'20 



768 



816 



864 



912 



960 



1008 



1041 




1 
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132 <22 3> OTHER 1 NFOPJ-IATiON : Description of unknown Organism: Obtained irom an 
135 <'I00> SEQUENCE: 2 

13G Met Ser GIu Pro Met Thr Lys Tyr Arq Gly Ala Ala Val Gin Ala Ala 

137 1 5 10 15 

138 Pro val Phe Leu Asp Leu Asp Ary Thr Val Glu Lys Ala He Gly Leu 

139 20 25 30 

14 0 Tie G.lu Gin Ala Ala Lys Gin Asp val Arq Leu He Ala Phe Pro Glu 

141 3 5 40 45. 

14 2 Tlir Trp He Pro Gly Tyr Pro Phe Trp Tie Trp Leu Gly Ala Pro Ala 

143 50 55 60 

144 Trp Gly Met Ary Phe val Gin Arg Tyr Phe Glu Asn Ser Leu Val Arg 

145 65 7 0 7 5 00 
14 6 Gly Ser Lys Gin Trp Gin Ala Leu Ala Asp Ala Ala Arg A^rg His Gly 
14 7 8 5 9 0 95 

14 8 Met His Val Val Ala Gly Tyr Ser Glu Arg Ala Gly Gly Ser Leu Tyr 

149 100 105 110 

150 Met Gly Gin Ala He Phe Gly Pro Asp Gly Asp Leu He Ala Ala Arg 

151 115 120 125 

152 Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly 

153 130 135 140 

154 Asp Gly Ser His Leu Ala Val His Asp Thr Ala T].e Gly Arg Leu Gly 

155 145 150- 155 160 
15G AJa Leu Cys Cys Trp Glu His He Gin Pro Leu Ser Lys Tyr Ala Met 

157 165 170 175 

158 Tyr Ala Ala Asp Glu Gin Val His Val Ala Ser Trp Pro Ser Phe Ser 

159 180 18 5 19 0 

160 Leu Tyr Arg Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala 
.161 195 200 205 

162 Ala Ser Gin He Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser 

163 210 215 220 

164 Cys Ala Thr Val Ser Pro Gin Met He Lys Val Leu val Asp Thr Pro 

165 225 230 235 240 

166 Asp Lys Gin Met Phe Leu Lys Ala Gly Gly Gly Phe Ala Met He Phe 

167 245 250 255 
163 Gly Pro Asp Gly Arg Ala Leu Ala Glu Pro Leu Pro Glu Thr Glu Glu 
169 260 265 270 

17 0 Gly Leu Leu Val Ala Asp He Asp Leu Gly Met He Ala Leu Ala Lys 

171 275 280 285 

172 ?^la Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Arg 

173 290 295 300 

174 Leu Leu Leu Asp Arg Arg Pro Ala Gin Arg Val Val Thr Leu Asp Ala 

175 305 310 315 320 
17 6 Ala Phe Glu Pro Gin Asn Glu Asp Lys Gly Asp Ala Pro Ala Leu Arg 

177 325 330 335 

178 Val Val Ala Glu Ser Ala Ala Ala Ala Gin 

179 34 0 34 5 

183 <210> SEQ ID NO: 3 

184 <21i> LENGTH: :1014 

185 <212> TYPE: DNA 
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1 o cr 
1 b D 


<213> ORGANISM: 


Unknovm 


Orga[iism 


















1 c o 


<2 2 


0> FEATURE; 




























iby 


<2 2 3> OTHER 


INFOm^ATION 


: Description of 


Unknown 


Organism: Obtained from 


1 \i U 




(i n V i. r on ni e n t a 1 


sample 






















.1 y ^ 


<2 2 


0> fEATURK: 




























TOT 

i y 


<22l> WAME/KKY : 


L JJo 


























1 y '1 


<222> LOCATION: 


( 1 \ 


. . (1014 ) 






















1 0 

1 y u 


<4 00> SBOUftlNCf;: ; 


•J 


























TOT 

1 y / 


0 Ly 


%\ ;i ^1 


' J Cl <J 


gcL 


d L L 


aag 


y tc 


gcc 


tgc 


gtg 


caa 


gcc 


gcc 


ccg 


ate 


ta c 


A Q 
'I O 


ly o 




r ■« J- i.- 
1-1 /s> 


bXU 




J- X tl 


Liy s 


V U X 


AX a 


ey s 


va X 


ux n 


Ala 

Hxa 


A 1 -1 

AX a 


L'^ro 


lie 


Til / 

Tyr 




i y y 


I 


















X u 










X J 






^ U i 


Cl L' J 


gd c 


t 

X. Lg 


ga g 


gcg 


a c (-1 


gt g 


gae 


aa a 


acc 


'1 4- 1-- 

d t L 


gtig 


ttg 


d Lg 


gaa 


gaa 


J o 


T ri '*> 




Asp 




fill 


a 1 a 

ri J.« 


1 11 X 


Val, 


Asp 


Jjys 


J nr 


J. le 


1 1 1 

h J.U 


Leu 


wet 


1,j1 U 


(jIU 




^ U J 




























J u 








20 5 




gca 


eg t 


a a L 


dd L 


get 


eg L 


c tig 


ate 


gcc 


Lt L 


ccg 


gaa 


dC L 


tgg 


-1 1- 

at L 


X 't '1 


f^ r 
Z U O 


TV "1 -1 

Aid 


Aid 


A rg 


As n 


A 


AX a 


Arg 


Leu 


lie 


A J,d 


l'^ 1-1 r-1 

Pne 


Pro 


/""111 

Glu 


1 IlT 


J rp 


lie 




OAT 
Z U / 
















1 n 

4 u 




















2 uy 




ggc 


T_ac 


oca 


tgg 


t Lt 


ctt 


tgg 


c tt 


gac 


tea 


CCa 


gca 


tgq 


gca 


a tg 


T f 1 ''1 

X y ^ 


210 


Pro 


ii J.y 


Tyr 


Pro 


J. r p 


ri K --y 

H lie 


Leu 


Trp 


Lei] 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


Met 




nil 




J U 










J 5 






















2 1 S 


C ci3 


1- 1- h 

LUC 


g La 


eg c 


cd a 


tae 


Cd t 


gag 


aa c 


tea 


ttg 


gag 


ttg 


g t 


99 <- 


cot 


240 


2 14 


b i. ri 


Pile 


Vd 1. 


A-C'Q 


\.l X 11 


J. y r 


H 1 b 


G.l u 


Asn 


Ser 


Leu 


bill 


Leu 


Asp 


Gly 


Pro 




Z J. D 












/ 0 










/ J 










o u 




217 




^ C L 


ddg 


/'I 

eg L 


d Lt 


tea 


gat 


gca 


gcc 


aag 


egg 


ttg 


gga 


dLC 


-1 

dug 


gtc 


'■> o o 
zoo 


2 1 o 


o J.n 


Ala 


Lys 


Arg 


Tin 


oer 


AS p 


TV '1 -1 

Aid 


Al a 


Lys 


Al g 


Leu 


oly 


I !.e 


Met 


Vd J. 




z ly 




















o n 
J u 










y D 








n 


c Lg 




a tg 


ag t 


gaa 




gtc 


ggt 


99 c 


acc 


Ctt 


tac 


ate 


ag t 


eag 


O O 


2 22 


I J 1,1 


J-jG u 


X / 


I'le L 




ij .1. U 


Arg 


va J. 


Gly 


Cj 1 y 


Thr 


Leu 


Tyr 


T 1 - , 
I Xti 


Ser 


Gin 




n O ''j 
Z Z J 








.1 00 










.1. U D 










Tin 

XI u 








'1 c 

2 ZD 


tgg 


ttc 


ata 


ggc 


gat 


aa t 


ggt 


ga c 


acc 


-t -4- 4- 

att 


9g9 


gcc 


egg 


ega 


aag 


4- 4~ y-4 

ttg 


3o4 


z i u 


Trp 


Phe 


IlG 


Gly 




Asn 


Giy 


Asp 


1 nr 


11 e 


Gly 


AX a 


Arg 


Arg 


Lys 


IiCU 




0 O 7 
Z z / 






115 










X z u 










.1 Z -) 










o •^1 ri 


a a a 


cct 


act 
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PATENT APPLICATION; US/09/751,299 



DATE: 01/2 3/2001 
TIME: 10:22:43 



Input: Set : A:\Sequence DIVERi44 0-2 
OutpuL Set: N:\CRF3\01232001\l751299.raw 
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Input Set : A:\Sequence DIVERi44 0-2 
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L:12 14:270 C: Current Application Number dilfers, Replaced Current Application Number 
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